Oracle Evaluation of Flexible Adaptive Transforms for Underdetermined Audio Source Separation

نویسندگان

  • Andrew Nesbit
  • Mark D. Plumbley
  • Emmanuel Vincent
چکیده

We describe and apply a flexible, adaptive cosine packet transform to separate audio sources from instantaneous, underdetermined audio mixtures by time-frequency masking. Previously studied adaptive transform schemes have two main drawbacks: the signal can only be partitioned into dyadic intervals, and the profiles of the overlapping windows are often very short, thus tapering off very quickly. The novel aspects of our new approach are that it admits a much larger library of admissible orthogonal bases, and thus does not require dyadic segmentation and alleviates border artifacts at window boundaries. Oracle estimation, which determines experimental upper performance bounds of our techniques, demonstrates potential performance improvements of up to 3.0 dB SDR, when compared with fixed-basis transforms such as the short-time Fourier transform and modified discrete cosine transform, and the previously studied adaptive cosine packet decomposition scheme.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation

We apply sparse, fast and flexible adaptive lapped orthogonal transforms to underdetermined audio source separation using the time-frequency masking framework. This normally requires the sources to overlap as little as possible in the time-frequency plane. In this work, we apply our adaptive transform schemes to the semiblind case, in which the mixing system is already known, but the sources ar...

متن کامل

A Study of the Effect of Source Sparsity for Various Transforms on Blind Audio Source Separation Performance

In this paper, the problem of blind separation of underdetermined noisy mixtures of audio sources is considered. The sources are assumed to be sparsely represented in a transform domain. The sparsity of their analysis coefficients is modelled by the Student t distribution. This prior allows for robust Bayesian estimation of the sources, the mixing matrix, the additive noise variance as well as ...

متن کامل

First Stereo Audio Source Separation Evaluation Campaign: Data, Algorithms and Results

This article provides an overview of the first stereo audio source separation evaluation campaign, organized by the authors. Fifteen underdetermined stereo source separation algorithms have been applied to various audio data, including instantaneous, convolutive and real mixtures of speech or music sources. The data and the algorithms are presented and the estimated source signals are compared ...

متن کامل

On the Use of Latent Mixing Filters in Audio Source Separation

In this paper, we consider the underdetermined convolutive audio source separation (UCASS) problem. In the STFT domain, we consider both source signals and mixing filters as latent random variables, and we propose to estimate each source image, i.e. each individual sourcefilter product, by its posterior mean. Although, this is a quite straightforward application of the Bayesian estimation theor...

متن کامل

A Signal-adaptive Local Cosine Transform for Source Separation by Time-frequency Masking

Time-frequency masking is often used for source separation of underdetermined audio mixtures. It depends on the fact that the sources can be represented disjointly in some transform domain. The focus of this paper is on demixing sources from instantaneous, two-channel mixtures by binary masking. We investigate trees of local cosine bases from which a suitable transform may be generated—the best...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008